speculative : fix seg fault in certain cases #12454

ggerganov · 2025-03-18T16:12:53Z

Fixes a crash when using tree-based speculative decoding with greedy sampling:

make -j llama-speculative && ./bin/llama-speculative -m ../models/qwen2.5-32b-coder-instruct/ggml-model-q4_0.gguf -md ../models/qwen2.5-0.5b-coder-instruct/ggml-model-q4_0.gguf --ctx-size 0 -ub 4096 -b 4096 -ngl 99 -ngld 99 -fa --draft-max 8 --draft-min 0 --draft-p-min 0.75 -c 4096 --color -p "Write a quicksort algorithm" -np 4 --top-k 1 -s 1

speculative : fix seg fault in certain cases

85658e9

github-actions bot added the examples label Mar 18, 2025

ericcurtin approved these changes Mar 18, 2025

View reviewed changes

ggerganov merged commit c6af216 into master Mar 18, 2025
47 checks passed

ggerganov deleted the gg/speculative-fix branch March 18, 2025 17:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

speculative : fix seg fault in certain cases #12454

speculative : fix seg fault in certain cases #12454

Uh oh!

ggerganov commented Mar 18, 2025

Uh oh!

Uh oh!

Uh oh!

speculative : fix seg fault in certain cases #12454

speculative : fix seg fault in certain cases #12454

Uh oh!

Conversation

ggerganov commented Mar 18, 2025

Uh oh!

Uh oh!

Uh oh!